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Abstract —With the help of an in-band full-duplex relay station, 
it is possible to simultaneously transmit and receive signals 
from multiple users. The performance of such system can be 
greatly increased when the relay station is equipped with a 
large number of antennas on both transmitter and receiver 
sides. In this paper, we exploit the use of massive arrays to 
effectively suppress the loopback interference (LI) of a decode- 
and-forward relay (DF) and evaluate the performance of the 
end-to-end (e2e) transmission. This paper assumes imperfect 
channel state information is available at the relay and designs 
a minimum mean-square error (MMSE) filter to mitigate the 
interference. Subsequently, we adopt zero-forcing (ZF) filters 
for both detection and beamforming. The performance of such 
system is evaluated in terms of bit error rate (BER) at both relay 
and destinations, and an optimal choice for the transmission 
power at the relay is shown. We then propose a complexity 
efficient optimal power allocation (OPA) algorithm that, using the 
channel statistics, computes the minimum power that satisfies the 
rate constraints of each pair. The results obtained via simulation 
show that when both MMSE filtering and OPA method are used, 
better values for the energy efficiency are attained. 

Index terms - Decode-and-forward relay, in-band full-duplex, 
massive multiple-input multiple-output (MIMO), MMSE, ZF, 
loopback interference cancellation, optimal power allocation. 

I. Introduction 

In-band full-duplex systems in wireless communications 
have attracted a lot of research interest in the past few 
years. This feature is expected to be implemented in future 
generation networks and gives the possibility of transmitting 
and receiving data at the same time instant and frequency 
band, which means that with the same amount of energy as 
in the half-duplex (HD) counterpart, it is possible to double 
spectral efficiency |lj. However, the inherent complexity of 
such systems has been hindering the deployment of full- 
duplex enabled equipment. The main obstacle to overcome is 
the difficulty in canceling the signal leakage from the relay 
output to its input @ CD Moreover, hardware constraints, 
algorithms’ inefficiency and difficulty in modeling the channel 
have also posed problems, hence the need for further analysis. 

Alongside the developments in full-duplex, a lot of research 
has been conducted recently on the capabilities of multiple- 
input multiple-output (MIMO) systems and its extension to 
very large arrays (dubbed as massive MIMO). The idea in 
future generation systems is to use hundreds of antennas at 
the base station in the hope of serving more users and devices. 
This study involves various signal processing techniques, such 


as channel modeling and estimation, precoding and detection 
algorithms, which must be efficient in terms of complexity 
and perform as close to optimal as possible. 

In this work, we consider a massive MIMO full-duplex 
relay architecture, which uses both of the aforementioned 
state-of-the-art technologies. We assume a group of K sources 
communicating with a group of K destinations. This transmis¬ 
sion is supported by a decode-and-forward (DF) relay station. 

In the first part of this paper, we are interested in loopback 
interference (LI) mitigation techniques for the considered 
setup. We assume that both source to relay and LI channels are 
known (apart from an estimation error) to design a minimum- 
mean square error filter. This is done in (4) for a small 
number of antennas. We show that by exploiting the massive 
MIMO effects and using simple linear processing techniques, 
LI can be effectively attenuated. The results are evaluated in 
terms of end-to-end (e2e) bit error rate (BER) performance. 
Furthermore, an algorithm to find the optimal power allocation 
(OPA) that fulfils the needs of each communication pair in 
terms of achievable rate and peak power is presented. 

II. System Model 

Let us consider K user pairs establish a wireless connection 
through a relay station, which operates in in-band full-duplex 
mode (sharing the same time-frequency resources). For this 
purpose, each user is equipped with a single antenna, while 
the relay has N rx receiving antennas and N tx transmitting 
antennas. The number of antennas at the relay may go up 
to a few hundred and the number of served users is in 
general considered to be much lower (K NrxiNtx)- 
The nonexistence of a direct link between the sources and 
destinations is assumed, hence all links are established via the 
relay station. A simplified representation, as well as the used 
notation to be defined in this section, can be seen in figure [T] 

A. Channel Model 

For the evaluation of the performance of such setup, we are 
interested in the vectors containing the transmitted symbols 
from the sources to the relay and the beamformed symbols 
from the relay to the destinations, denoted by x G C Kxl and 
t G C NtxXl , respectively. These vectors are assumed to be 
taken from an M- QAM constellation and the output signals 
to be normalized and uncorrelated, such that E(xx^) = I and 



Figure 1. Full-duplex relay system. 


E(t^t) = 1. In this manner, the power transmitted by each 
source is independent of K and the total transmitted power at 
the relay dissociated of N tx . The received vectors at the relay 
and at the destinations are given by 0 and 0, respectively. 

r = G SR Dy 2 x + ^/p^H u t + n r . (1) 

Yd = v^Grd* + n d- ( 2 ) 

The uplink symbols in x pass through a MIMO channel 
from the sources to the relay denoted by Gsr G C NrxXK , 
whereas the symbols from the relay to the destinations pass 
through Grd G C KxNtx . Both matrices are expressed as 
Gsr = H S rD^ 2 and G RD = D^ 2 H rd , where D S r and D rd 
are diagonal matrices with entries /3$R,k and /3 rd ,k taken from a 
log-normal distribution, which account for large-scale channel 
effects. The fast-fading channel components are present in 
H sr and H RD , both with independent entries taken from 
a CAT(0,1) distribution. The LI channel is represented by 
Hli with distribution CA/*(0, cr RI ), where cr^i accounts for the 
residual LI power after self-interference suppression imposed 
in the analog-domain. The vectors n r ^ CAT(0,(Jn r ) an d 
n d CA/"(0, cr„ d ) account for additive white Gaussian noise at 
the relay and destinations, respectively. Finally, the diagonal 
matrix D Ps with entries ps,k regulates each source transmit 
power, while pr denotes the relay’s average transmit power. 

B. Channel Estimation 

In order to efficiently apply detection, precoding and LI 
mitigation techniques, channel estimates Hsr, H L i and H RD 
of the true Hsr, Hli and Hrd are required. Applying any 
of the estimation algorithms proposed in the literature 0. 
we assume an error in the channel matrices modelled by 
£h sr ^ ^Hrd , £h li ^ CA/^O, Cpj). The difference between the 
non-ideal estimates and the true channel values are 

Hsr = H sr + £h sr 5 Hli = H LI + £h li ; H RD = H RD + £ Hrd • 

( 3 ) 

No errors in the large-scale fading matrices are assumed. 
The impact of any hardware imperfection at the relay is 
modeled by means of an additive error 0 in the transmitted 
vector given by t = t + £ t , where t is the vector to be 
transmitted after baseband filtering and where all the elements 
of £ t ~ CAT(0, el) are assumed to be uncorrelated with t. It is 


worth mentioning that the covariance of the LI term y/pRH LI t 
in |l]) is controlled by the parameters pr, crfy, el and e^. 


C. Detection and Precoding 

Let us consider for the moment an equivalent received signal 
r, whose LI component has been minimized. Hence, we are in 
the presence of an uplink channel where K « N rx . Under 
this assumption, linear processing techniques are proven to 
perform close to optimal (2). We consider the usage of zero¬ 
forcing (ZF) filtering for both detection and beamforming 0 . 
Using the estimation of Gsr, denoted by Gsr = HsrD^ 2 , 
the estimated symbols after a ZF filter W z f are given by 

X = Q( Wzff) = Q((G s ff R G SR )- 1 G| / R r), (4) 

where Q(-) is a symbol-wise quantizer to the constellation 
set. Upon detection based on x, the estimated symbols are 
forwarded to the destinations y d , which we assume to have 
very limited processing capabilities. Thus, a ZF precoder A z f 
is used. The corresponding expression is 

t = A zf x = a zf G RD (G RD G RD ) -1 x, (5) 

where Grd = D r 7 d Hrd is the estimation of the true Grd 
and a zf = (E(£r{(G RD Gjf D ) -1 })) - 2 is a scalar chosen to 
normalize the power of t, i.e., E(t H t) = 1. From the definition 
of Hrd ~ CJ\f(0 ,1 + Cfj), a z f comes as (cf. appendix A) 


&zf - 


(N tx - K ) 


v -l • 


J2k=l (^RD,fe(l + 6 h)) 

III. Loopback Interference Mitigation 


( 6 ) 


A. Linear Filtering 

We now evaluate the problem of suppressing LI, using linear 
processing. Under the model given by ([!]), we are interested 
in minimizing the LI term y/pRH L it, while preserving the 
signal G sr 2 x and taking into account the covariance of 
the noise vector R nr at the relay. Hence, the mean square error 
between the desired signal and the one received by the relay 
station is E((G S rDps 2 x — r)(G S RDps 2 x — r) H ). Using the 
method proposed in 0 we consider a linear prefilter F tx and 
postfilter F rx , such that t = F tx t and r = F rx r. The filter that 
finds the MMSE when F^ is fixed satisfies (cf. appendix B) 

F rx H LI F tx = 0, (7) 

and when F tx is fixed becomes 

F rx = GsrD Ps G|^(GsrDp s G|^ ApRHLiRtHj^ + R n J 1 , 

( 8 ) 

where R t = F tx A z fRxA^F t ^f+e^I. The solution provided by 
0 corresponds to the null-space projection already presented 
in 0, and in general finding a reliable solution for large 
dimensions is hard. On the contrary, 0 is a closed-form 
expression containing the channel estimates and covariance 
matrices of transmitted vectors and noise, that are assumed to 
be known. Under the considered model and for a given set of 
parameters, a trade-off in the e2e BER is expected: on the one 















































hand, a lower transmitted power at the relay p R reduces the 
LI power and diminishes any impact of the estimation errors 
and radio-frequency impairments; on the other hand, a higher 
Pr leads to improved signal-to-noise ratios (SNR) levels at 
the destinations and hence a lower BER in the forward link 
channel. Thus, for a fixed source transmit power, an optimal 
choice for pr that minimizes the e2e BER can be anticipated. 

B. Optimal Power Allocation 

The level of interference suffered by the relay station 
depends critically on its transmission power and directly 
perturbs the e2e performance. Thus, finding the optimal power 
that meets the requirements of the system is desired. Let 
us consider the achievable rate for each individual link. The 
procedure done in |7] allows us to obtain an expression for 
the transmission link rate between each source and destination 

R k = min{i? S R,/c, ^rd,/c} 5 (9) 

where RsR,k and i? RD ,/c denote the achievable rates between 
the sources and the relay and between the relay and the des¬ 
tinations, respectively. To obtain R k , we start by considering 
the received signal at the relay before detection given by 

Ur,k = V ^(W z fF rx ),gSR,^ fc 

+ VPs,j(W z fF rx ) fe gsRjffj (10) 

+ v / PR( w z fF rx )fcH L it + (W zf F rx )^n r , 

where gsR,/c denotes the k th column of Gsr. The received 
signal at each destination link before detection is given by 

Vd,k =VPRgRD > fc( F txA zf ) fc X ft 

\~^ K T « (11) 
+ VPR / &RD,fc(Ftx-A- z f)j£j + 

Both expressions may be seen as a known mean gain times the 
desired signal plus an uncorrelated effective noise term that 
includes channel impairment effects, interpair and loopback 
interference, and Gaussian noise. A technique commonly used 
in large MIMO systems (8) is to approximate the effective 
noise term by a Gaussian noise term under the central limit 
theorem, and which gives a good approximation for RsR,k as 
in ( p~3] ) and for Rrv^ as in ( p~4] ). 

The goal is to find the system power allocation, i.e., compute 
the required power transmitted by both sources and relay, 
such that the desired rate for each communication pair k 
is guaranteed. Moreover, peak power constraints should be 
satisfied and overall power consumption taken into consider¬ 
ation; in other words, the system energy efficiency, defined 
as EE = X)fcLi Rk/(PR + X)^=iPs,fe)» should be maximized. 
This can be described as the following optimization problem 

miIL ,PS,k+PR, 

PRiPs,i---Ps,k z — J k=l 

S.t. k=l,...,K; (12) 

0<Ps,k<Ps Q}k , k=l,...,K; 

0<pr<pr o , 


where Rq^ and ps o k are the required rate and peak power for 
pair k respectively, and p Ro is the relay station peak power. 
The solution to problem (l2| ) involves the computation of the 
channel statistics which are the output of involved expres¬ 
sions due to the non-linear dependence of the optimization 
variables in F rx . For that reason, we propose a simple and 
computationally efficient iterative algorithm, based on linear 
programming, capable of obtaining the referred OPA. The 
algorithm computes the channel statistics coefficients in ( p~3] ) 
and the loopback mitigation filters and the power vectors 
iteratively until the optimal power vector is reached. The 
procedure is summarized in algorithm [I] 


Algorithm 1 

1. Initialization: Set i = 1; initialize powers ps,k,i = Ps 0jk 
and Pr o = Pr 0 ; define L as the total number of iterations and 
set N it as the number of channel realizations per iteration. 

2. Iteration i: 

1) Compute channel statistics: 

for N it do 

i) Generate G S r, G rd , H u , W zf and A zf 

ii) Compute filter F rx with ps,k,i and p R ^ 

iii) Compute instantaneous rate coefficients for all k pairs: 
MVsr,/c, VsR,fc, MP sr#(M , LI S R,fc and ANsr,/c (as in ( [13] )) 
MVrd,/c, V rd ,/c, MPrd^ and ANrd^ (as in ( [14] )) 

end for 

2) Average to obtain channel statistics. 

3) Solve the linear program ( [12] ) with the coefficients found 
in step 2) to obtain the new ps,k,i and p R ^. 

4) Set ps,k,m = Ps,k,i and p R ,;+i = p R ,*. 

3. Check: If i — L end algorithm, else set i = i + 1. 


IV. Numerical Results 

In this section, we compare the performance of the proposed 
filters with special emphasis on the e2e link reliability and on 
the efficiency of the power allocation scheme. All results are 
obtained via Monte Carlo simulation, using uncoded MIMO. 


A. System Parameters 

We assume a symmetric system with the same number of 
users K on both sides with one antenna each and N = N tx = 
N rx antennas at the relay. Transmission and reception at the 
relay are done in the same time-slot and frequency band, and 
an arbitrary processing delay d > 1 is assumed such that at 
time instant i: x[z] «= /(x[z— d]). We consider F rx as in ^ and 
it loss of generality, we also consider a?, = 1. 


F tx = I. Without: 


The SNR at the relay is defined as SNRr = 


= 1 PsR,kPS,k 


B. Transmitted Power vs. BER 


We start by evaluating the performance of the system in 
terms of BER at both the relay and destinations, considering 






RsR,k — 


log 2 ( 


1 


PS,/cMV S R,/e 


PS,k^SR,k + Z^j//cPS,j M PsR,(/c,j) +PrLIsR,/c +AN S R,fc y 

where MV S r,/c = |E{(W z fF rx )^g S R,/c}| 2 , Vsr,/c = Var{(W z fF rx )^g S R,/c}, MP SR> ( fejJ -) = E{|(W z fF rx )^g S Rj| 2 }, 
LIsr,* = E{ || w z T f?fc F rx H LI F tx A zf || 2 } and AN SR ,/c = a 2 r E{|| (W^F^ || 2 }. 

PrMVrd^ \ 


-)■ 


RRD,k ~ 


i°g 2 ( 


1 + 


PrVrd,/c + PrMPrd^ + ANrd,/c ' 


K 


(13) 


(14) 


where MV RD)fc = |E{g^ fc (F tx A zf ) fc }| 2 , V RD , fc = Var{gj[ D fc (F tx A zf ) fc }, MP RD , fc = J2 E (|gR D ^F^Arf)^ 2 } and AN RD ,fc = °l d - 


only small-scale fading, that is D^ 2 = D^ 2 = I. Firstly, for a 
given SNRr and fixed uniform transmitted power ps,k = 1 for 
all k, we are interested in studying different allocated powers 
at the relay p R and for an increasing number of antennas N. 
For comparison purposes, we implemented natural isolation 
(NI) (when F rx = F tx = I, which corresponds to ignoring 
the LI component) and the HD counterpart (when t = 0 and 
F rx = Ftx = I). Setting the variance in the errors to be e 2 = 
e H = 10- 3 , curves of BER for different numbers of antennas 
and relay power p R are depicted in figure [2] It can be seen 
that for an increasing number of antennas, the system using 
the MMSE filter becomes more robust to the LI, showing a 
BER performance closer to the HD for higher values of power 
Pr than when compared with NI. We then evaluate the e2e 
BER in terms of the allocated power pr and using the same 
setup as before. For this purpose, we set the variance of the 
noise at the destinations cr 2 d , quantize the received signals in 
([2]) and compare them with the symbols transmitted by the 
sources. The results are shown in figure [3] One can confirm 
that for a given configuration there is an optimal choice for 
the power at the relay that minimizes the e2e BER. Moreover, 
it is concluded that a larger number of antennas attains the 
minimum BER with less power. 

C. Optimal Power Allocation Algorithm 

Finally, we evaluate the results of the proposed algorithm in 
section |IlLB| finding the OPA that meets the rate constraints of 
each link, while minimizing the overall power of the system. 
Large-scale fading effects are now taken into consideration, 
more precisely, PsR,k and /3rd, k are independent variables 
generated from a log-normal distribution with mean value 
mm 1 and standard deviation cr = 6 dB. Additionally, we 
set the normalized peak power of the sources and relay as 
Ps 0 k = 3 dB and pr 0 = 10 dB, respectively. In addition, 
parameters N it = 10 3 and L = 5 are empirically defined, such 
that both channel statistics and output powers in algorithm [T] 
are good approximations of their real value. The performance 
of algorithm [I] is determined in terms of EE for the case 
where interference effects are disregarded (OPA-NI) and when 
the MMSE filter is used (OPA-MMSE). This method is also 
compared with the situation where an optimal uniform power 
allocation (OUPA) is used, which corresponds to consider 
Ps,k = Ps, for all k, in <[T2 ]i. Figure [4] shows the curves 
of average EE for different values of desired e2e sum-rate, 



Figure 2. BER performance at the relay for different numbers of antennas 
N, K = 5 pairs, SNRr = 8 dB and 16-QAM. 



Figure 3. End-to-end BER performance for different N and K = h 
pairs, SNRr = 8 dB and 16-QAM. 

defined as ^ k=1 Ro,k> where the individual required rates 
Ro,k are taken from a discrete uniform distribution. 

As one may see, for the same desired sum-rate, the EE 
of OPA-MMSE is improved significantly when compared 
to OUPA, while guaranteeing that no link is in outage 
(R k > Ro,k for all k ). Furthermore, since MMSE filter ef¬ 
fectively reduces the LI, a lower amount of energy is spent to 
achieve the same sum-rate when compared with OPA-NI, an 
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Lemma 2.10] that 



Figure 4. Energy efficiency for different power allocation schemes, filters 
and number of antennas. The remaining parameters are = 1, K = 10 
pairs and SNRr = 16 dB. 


effect that becomes more preponderant in higher rates regimes. 
V. Conclusions 

This paper studies an independent single-antenna multipair 
communication system using an in-band full-duplex relay 
station equipped with a large number of antennas. Considering 
imperfect channel state information, the relay applies two 
stages of filtering. Firstly, inner ZF filters are used in order to 
perform both detection and beamforming. Secondly, an MMSE 
outer filter is designed to mitigate LI, while preserving the e2e 
channel. We assess the effectiveness of the MMSE filter via 
simulation in terms of BER at the relay and comparing it with 
NI and the HD counterpart. Moreover, in the e2e BER analysis, 
an optimal choice for the power at the relay was identified. 
We further confirmed the low complexity, high performance 
capabilities of linear processing in systems employing massive 
MIMO. We provide the expressions for the e2e achievable rate 
of the studied setup and propose a low complexity iterative 
algorithm to perform OPA. The latter takes into consideration 
large-scale fading effects, peak power constraints and the rate 
requirements for each individual pair. The results were evalu¬ 
ated in terms of EE, and the system employing MMSE filtering 
outperformed both NI and OUPA, specially for high sum-rates. 

Appendix 

A. Normalization factor a z f 

The objective is to design a z f such that Ejt^t} = 1. Using 
its definition t = <TzfG^(GRDG^)) _1 x = a z fPx, and the 
fact that C = E{xx^} = I, it follows that 

E{t^t} = a z 2 f E{(Px) ff (Px)} = a 2 zf tr{PCP H } = 1. (15) 

Exploiting C = I, it follows that tr {PCP H } is given by 
^{(GrdG^)) -1 } = ^{(DrdHrdH^) -1 }. Now noting 
that W = HrdH^ d is a central Wishart matrix, where the 
columns of Hrd G C KxNtx are zero-mean complex Gaussian 
vectors with covariance matrix (1 + e^)I, it follows from |9[ 


E^DrdW})- 1 } = 


Sfc=l (/?RD,k(l + Ch)) 

N tx - K 


( 16 ) 


Merging © and ( [T6| ), the factor a zf in El follows. 

B. F tx and F rx expressions 

The error covariance matrix Q of the relay input signal is 


(17) 


given by E((G SR x - r)(G SR x - r) H ), yielding |4j 
Q =(I - F rx )G SR D Ps Gf R (I - F ra ) ff 


- p R F lx H L iR/H^F^ 


F P 

1 rx A Mi r r rx i 


where R t = + Hg t = E(tt^) + E(£ t £^) = 

F tx A zf RxA^F t ^ + e 2 l. Using the fact that the function 
/ = £r{ZAoZ^Ai} has derivative given by fzcf = AiZAq 


Table 4], one may easily find -^tr{ Q} = 0 and 

tx 

^p^£r{Q} = 0. Fixing F tx comes 


d 

3F* 


tr{ Q} =F rx (p R H L iR t H£ / I + R n 


(18) 


— (I — Frx)Gs R D Ps G S R — 0, 
which, when solved, gives 0- Fixing F rx leads to 

d 

dF; 


-tr{Q} =D K F rx H LI (^ R F; x ' HLiF tx A 2/ RxA : 


(19) 


—D ? , s F,. x H L i(F rx H L |F tx )A zt R x A zf — 0, 


whose solution is given by 0. 
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